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Streptococcus pyogenes is a significant pathogen of humans, annually causing over 
700,000,000 infections and 500,000 deaths. Virulence in S. pyogenes is closely linked 
to mobile genetic elements like phages and chromosomal islands (CI). S. pyogenes 
phage-like chromosomal islands (SpyCI) confer a complex mutator phenotype on their 
host. SpyCI integrate into the 5' end of DNA mismatch repair (MMR) gene nnutL, which 
also disrupts downstream operon genes ImrP, ruvA, and tag. During early logarithmic 
growth, SpyCI excise from the bacterial chromosome and replicate as episomes, relieving 
the mutator phenotype. As growth slows and the cells enter stationary phase, SpyCI 
reintegrate into the chromosome, again silencing the MMR operon. This system creates 
a unique growth-dependent and reversible mutator phenotype. Additional CI using the 
identical attachment site in mutL have been identified in related species, including 
Streptococcus dysgalactiae subsp. equisimilis, Streptococcus anginosus, Streptococcus 
intermedius, Streptococcus parauberis, and Streptococcus canis. These CI have small 
genomes, which range from 13 to 20 kB, conserved integrase and DNA replication 
genes, and no identifiable genes encoding capsid proteins. SpyCI may employ a helper 
phage for packaging and dissemination in a fashion similar to the Staphylococcus aureus 
pathogenicity islands (SaPI). Outside of the core replication and integration genes, SpyCI 
and related CI show considerable diversity with the presence of many indels that 
may contribute to the host cell phenotype or fitness. SpyCI are a subset of a larger 
family of streptococcal CI who potentially regulate the expression of other host genes. 
The biological and phylogenetic analysis of streptococcal chromosomal islands provides 
important clues as to how these chromosomal islands help S. pyogenes and other 
streptococcal species persist in human populations in spite of antibiotic therapy and 
immune challenges. 
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INTRODUCTION 

Streptococcus pyogenes is a significant human pathogen, annu- 
ally causing over 700,000,000 infections and 500,000 deaths 
(Carapetis et al., 2005). Genome sequencing has revealed that 
prophages and other mobile genetic elements are important 
features of Streptococcus pyogenes (group A streptococcus) chro- 
mosomes, sometimes contributing up to 10% of the total DNA 
(Desiere et al., 2001; Ferretti et al, 2001; Banks et al., 2002; 
Canchaya et al., 2002). These genome prophages follow a typ- 
ical lambdoid gene arrangement, with sequentially organized 
modules for integration and lysogeny, DNA replication, tran- 
scriptional regulation, DNA packaging and head assembly, tail 
and tail fiber assembly, and lysis (Desiere et al, 2001; Banks et al., 
2002; Canchaya et al, 2002; Brussow et al, 2004). In S. pyo- 
genes and many other pathogens, these essential phage genes are 
often followed by one or more virulence genes such as toxins 



(Brussow et al, 2004). Numerous genes on S. pyogenes chromo- 
somes are the targets for site-specific integration by these mobile 
genetic elements, and for some of these genes, integration has the 
potential to interrupt or alter their transcription (McShan and 
Ferretti, 2007). Of these targeted genes, one location stands out 
both for its frequency of occupation by a chromosomal island 
(CI) as well as the potential phenotypic impact integration would 
have on the cell: the operon encoding the genes for DNA mis- 
match repair (MMR). We have characterized phage-like CI in 
S. pyogenes that integrate into MMR gene mutL, silencing this 
gene and the other downstream genes of the operon (Scott et al., 
2008, 2012). The integration of the S. pyogenes Chromosomal 
Island Ml (SpyCIMl) into the chromosome induces a com- 
plex mutator phenotype that results from the interruption of 
the operon and downstream DNA repair genes (Scott et al, 
2008, 2012). Largely due to the extensive and ongoing efforts to 
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sequence the genomes of many species of bacteria, additional 
islands integrated into mutL also have been identified in other 
Streptococcus species. This review wiU examine the CI identified 
so far, their known or potential impacts on host phenotype and 
survival, and implications for the evolution of this group and their 
host bacteria. A note concerning nomenclature: when referred to 
collectively as a group, S. pyogenes phage-like CI are referred to 
as SpyCI, but a specific CI is identified so to associated it with a 
particular strain or isolate (e.g., SpyCIMl, SpyCIM49, etc.). The 
same convention is applied to CI from other streptococcal species. 

SpyCIMl AND THE HOST MUTATOR PHENOTYPE 

Typically, site-specific recombination occurs between bacterial 
and phage genomes such that the transcription of the targeted 
host gene is unimpeded by the presence of the prophage (Fouts, 
2006). This maintenance of gene function is accomplished by two 
factors: (1) duplication of the host DNA sequence at the site of 
crossover by a portion of the phage DNA and (2) integration at 
the 3' end of the targeted gene so that the duplication can com- 
plete the original bacterial ORF (Fouts, 2006; Louie et al, 2007; 
McShan and Ferretti, 2007). By contrast, integration into the 5' 
end or the middle of a gene could result in the disruption of 
normal transcription with a concomitant loss of gene function. 
Occasional examples of prophages altering the expression of host 
genes have been reported in Escherichia coli and Staphylococcus 
aureus (Mason and Allen, 1975; Lee and landolo, 1986; Thomas 
and Drabble, 1986; Coleman et al, 1991; Campbell et al, 1992), 
but these occurrences have been notable in part because of their 
rarity. By contrast, genome sequencing has revealed that S. pyo- 
genes prophages frequently target attachment sites positioned 
at the promoter or 5' end of genes that, following integration, 
potentially could alter gene expression, or create polar mutations 
(McShan and Ferretti, 2007). Of these mobile genetic elements 
with the potential to alter gene expression, the phage-like chro- 
mosomal island that frequently targets and regulates the DNA 
mismatch repair (MMR) operon of S. pyogenes is perhaps the 
most remarkable. 

MMR has been extensively studied in E. coli where the sys- 
tem recognizes base pair mismatches in nascent, hemimethylated 
DNA and directs strand-specific repair. The newly synthesized 
unmethylated DNA strand, which contains the mismatch, is 
cleaved by the MMR system and repair is initiated by re-synthesis 
of the cleaved region. Gram-positive bacteria and eukaryotes 
do not rely upon methylation for strand recognition, probably 
instead relying upon modification of the beta clamp of DNA 
polymerase III for strand discrimination (Li, 2008). LeClerc and 
co-workers were the first to observe that the mutator pheno- 
type was present in wild populations of E. coli and Salmonella 
enterica at unexpectedly high frequencies and that these phe- 
notypes mapped to mutations in MMR genes (LeCIerc et al, 
1996). It was subsequently found that mutators were present in 
both pathogenic and non-pathogenic E. coli, suggesting that this 
trait conferred a selective advantage upon the cell in spite of 
the risk of increased frequencies of deleterious mutations (Matic 
et al., 1997). Subsequent studies showed that MMR mutants 
are frequently isolated from clinical strains of many species. 
For example, antibiotic treatment of Pseudomonas aeruginosa 



infections in cystic fibrosis patients correlates with the rapid 
appearance of drug-resistant MMR mutants (LeClerc and Cebula, 
2000; Oliver et al., 2002). Other examples have been found in 
Neisseria meningitis, Helicobacter pylori, Haemophilus influenzae, 
and Staphylococcus aureus (Bjorkholm et al, 2001; Richardson 
et al., 2002; Bayliss et al, 2004; Prunier and Leclercq, 2005; Trong 
et al., 2005). The frequency of such mutator strains is often 
high: for example, 20% of P. aeruginosa strain from cystic fibro- 
sis patients and over 50% of epidemic-associated serogroup A 
N. meningitis strain are mutators (Oliver et al., 2002; Richardson 
et al, 2002). The MMR system can act as a barrier for genetic 
diversity and bacteriophage transduction, thus inhibition of 
MMR removes this barrier and promotes diversification through 
homeologous recombination (Limia et al, 1998; Kataja et al., 
1999; Matic et al., 2000). However, in all of these species, the 
MMR defects result from mutations that render mutS or mutL 
permanently defective. The resulting mutator phenotype is a 
double-edged sword, however. The advantages gained by a cell 
in rapidly acquiring favorable mutations like antibiotic resistance 
are balanced by the possibility of deleterious mutations arising 
that diminishes cell viability. In S. pyogenes, a remarkable solution 
has evolved to achieve a mutator phenotype while minimizing the 
potential risks: a growth-dependent molecular switch controlled 
by a CI, which allows the cells to be phenotypicaUy wild type when 
resources are abundant but switching to a mutator phenotype 
when facing challenges like limited nutrient availability. 

The MMR operon of S. pyogenes Ml strain SF370 (Figure 1) 
is composed of the genes mutS, mutL, ImrP, ruvA, and tag, which 
encode MMR, a multidrug efflux pump of the major facilitator 
family, a HoUiday junction resolvase, and base excision repair 
glycosylase, respectively, (Ferretti et al, 2001). These genes are 
grouped on a polycistronic mRNA that is controlled by a sin- 
gle promoter upstream of mutS. Analysis of the Ml genome 
showed that the phage-like chromosomal island SpyCIMl was 
integrated between mutS and mutL, and this integration was sub- 
sequently found to interrupt the expression of mutL and the 
downstream genes (Scott et al., 2008, 2012). SpyCIMl is not a 
static element, permanently residing in the bacterial genome like a 
typical prophage, but rather a dynamic element that excises from 
chromosome during early logarithmic growth and replicates as a 
circular episome (Scott et al, 2008). As the bacterial population 
reaches the end of logarithmic phase and enters stationary phase, 
SpyCIMl re-integrated into its unique attachment site {attB) at 
the beginning of mutL (Figure 1, insert). Thus, SpyCIMl acts as a 
growth-dependent molecular switch to control the expression of 
MMR. The outcome of this switch is that the S. pyogenes cell alter- 
nates between a mutator and wild type phenotype in response 
to grovrth: during rapid cell division and DNA replication, the 
integrity of the genome is maintained by an active MMR system 
while during stationary phase or other periods of infrequent cell 
division, mutations may accumulate at a higher rate. 

This molecular switch controls additional operon genes down- 
stream of mutL. The next gene, ImrP, encodes a putative mul- 
tidrug resistance efflux pump (MDR) of the major facilitator 
family (Bolhuis et al, 1995; Putman et al., 2001). In Lactococcus 
lactis, the gene was characterized as an ATP dependent pump that 
extrudes multiple drugs across the membrane, preventing toxic 
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FIGURE 1 I SpyCIMI regulates the MMR operon through dynamic 
site-specific excision and integration. Tine molecular switch controlled 
by SpyCIMI is shown. The MMR operon of S. pyogenes is comprised of 
genes encoding DNA mismatch repair {mutS and mutL), multidrug efflux 
(ImrP), a Holliday-junction resolvase [ruvA), and base excision repair (tag). 
During exponential phase, SpyCIMI excises from the chromosome, 
circularizes, and replicates as an episome, restoring transcription of the 
entire MMR operon (WT). Excision and mobilization occurs early in 
logarithmic growth in response to yet unknown cellular signals (Insert; 



adapted from Scott et al., 2008). As logarithmic growth continues, 
SpyCIMI re-integrates into mutL at attB, and by the time the culture 
reaches stationary phase, the integration process has completed, again 
blocking transcription of the MMR operon. WT Wild type phenotype 
associated with unimpeded expression of the MMR operon. Color key of 
predicted gene functions: Green, genes of unknown function; red, 
possible toxin-antitoxin maintenance genes; light blue, DNA replication; 
dark blue, control of lysogeny; pink, transmembrane peptide; orange, 
site-specific integrase. 



accumulations of these chemicals in the bacteria (Bolhuis et al., 
1995). Presently, the natural substrate for LmrP in S. pyogenes is 
unknown, as is why a multidrug efflux pump is transcriptionally 
linked with a DNA repair operon in group A and related species 
of streptococci (Supplemental Table 1). However, the ability to 
regulate expression of this gene may have selective advantage for 
the streptococcus. In Listeria monocytogenes an LmrP homolog 
(mdrM) exists whose expression or inhibition can control the 
magnitude of the host cytosolic response to infection, and loss 
of MdrM protein function leads to a 3 -fold reduction in IFN- 
P response to infection (Crimmins et al., 2008). It may be that 
inhibition of LmrP expression in S. pyogenes similarly provides a 
mechanism of innate immunity avoidance. Indeed, if true, SpyCI 
regulation of the LmrP in S. pyogenes may have a large biological 
impact given how a MDR can influence multiple processes in a 
cell by removal of toxic or inhibitory substances. 

The next gene on the polycistronic message, ruvA, encodes 
an ATP-dependent helicase that promotes branch migration of 
HoUiday junctions during homologous genetic recombination 
and recombinant repair of damaged DNA. The loss of RuvA func- 
tion leads to increased sensitivity to UV damage as irradiation 
induced DNA lesions lead to arrested replication forks (Iwasaki 
et al, 1989; Tsaneva et al, 1992; Kaplan and O'Donnell, 2006). 
This increase in sensitivity to UV irradiation is clearly observed 
in S. pyogenes strains carrying SpyCI (Scott et al., 2008, 2012). 

The last gene on the operon, tag, encodes a 3-methyladenine 
DNA glycosylase I which is involved in base excision repair 
(BjeUand et al., 1993). This enzyme is important in recognizing 



and purging aberrant and modified bases from damage induced 
by DNA damaging elements such as the alkylating agent 
ethyl methanesulfonate (Wyatt et al., 1999). Loss of the 
3-methyladenine DNA glycosylase greatly increases the sponta- 
neous mutation rate associated with single nucleotide substitu- 
tion (Kaasen et al, 1986; BjeUand et al., 1993; Wyatt et al, 1999). 
The loss of gene expression from mutL to tag causes the cell to 
exhibit a complex mutator phenotype that impacts several DNA 
repair or maintenance systems (Scott et al, 2008, 2012). Indeed, 
the silencing of this operon may necessarily need to be reversed 
occasionally to maintain cell viability, given our observation that 
permanent loss of the ability to excise from the bacterial chromo- 
some lead to the use of a new promoter to express these genes in 
M5 strain Manfredo (Scott et al, 2012). 

SpyCI are frequent genetic elements in S. pyogenes genomes 
(Table 1).M serotypes associated with SpyCI carriage currently 
include Ml, M2, M4, M5, M6, M18, M25, M28, M31, M37, M49, 
M53, M59, M78, and M123 (Scott et al, 2008, 2012; Suvorov 
et al., 2009). Currently, it is not known whether SpyCI infect only 
a subset of S. pyogenes serotypes, perhaps defined by surface tar- 
gets for phage attachment, or whether most serotypes may serve 
as SpyCI hosts and the current sample size is simply too small. 
Other factors such as phage immunity proteins or the depen- 
dence of SpyCI on helper phages with limited host ranges may 
also influence the dissemination of these chromosomal islands. 
At least from the standpoint of integration, virtually all group A 
streptococci could serve as a host for SpyCI since the attB DNA 
sequence at the beginning of mutL is highly conserved. 
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In general, the presence of a SpyCI in a given S. pyogenes 
strain correlates with a higher mutation rate and UV sensitiv- 
ity when compared to strains lacking this chromosomal island 
(Scott et al., 2012). Different SpyCI+ strains do show a range of 
mutation rates, however (Figure 2). When compared to SpyCI- 
free strain NZ131, genome strains with the chromosomal island 
showed mutation rates that ranged between 5 and 167 times 
higher. Similarly, resistance to UV irradiation also showed a 
range of sensitivities. This strain-to-strain variation may reflect 
differences in SpyCI regulation that determine whether the chro- 
mosomal island tends to be integrated into mutL or excised 
as an episome (i.e., how frequently the MMR operon is tran- 
scriptionally active). Variations in the operator controlling the 
repressor and antirepressor may play a role in this decision to 



Table 1 | S. pyogenes strains with SpyCI Integrated into DNA MMR 
gene mutL that have been identified by genome sequencing. 



Strain 


Serotype 


Chromosomal 


GenBank 






island 


Accession 


SF370 


Ml 


SpyCIMI 


AE004092 


MGAS10270 


M2 


SpyCIM2 


CP000260 


MGAS10750 


M4 


SpyCIM4 


CP000262 


Manfredo 


M5 


SpyCIMB 


AM295007 


GA40377 


M5 


SpyCIM5 GA40377 


AWTK01 000000 


GA41046 


M5 


SpyCIM5 GA41046 


AWWB01 000000 


UTMEM-1 


M5 


SpyCI UTMEM-1 


NZ_AVCFOOOOOOOO 


UTSW-2 


M5 


SpyCI UTSW-2 


NZ_AVCGOOOOOOOO 


MGAS10394 


M6 


SpyCIM6 


CP000003 


GA19700 


M6 


SpyCIM6 GA19700 


AWTF01000000 


GA41039 


M6 


SpyCIM6 GA41039 


AWTG01000000 


GA41208 


M6 


SpyCIM6 GA41208 


AWTH01000000 


T253 


M25 


SpyCIM25 


(Unpublished) 


MGAS 6180 


M28 


SpyCIM28 


CP000056 


GA03747 


M49 


SpyCIM49 GA03747 


AWUB01 000000 


GA16797 


M49 


SpyCIM49 GA16797 


AWUD01 000000 


GA40634 


M49 


SpyCI GA40634 


NZ_AURU00000000 


Alab49 


M53 


SpyCIM53 


CP003068 


MGAS1882 


M59 


SpyCIM59 


CP003116 


MGAS15252 


M59 


SpyCIM59.1 


CP003121 



remain integrated or extrachromosomal as well as variations 
in other DNA repair genes that affect the overall cell muta- 
tion rate (Scott et al, 2012). The one exception to this general 
trend was found in serotype M5 strain Manfredo, which has a 
128 bp deletion in the SpyCI integrase gene that renders it inac- 
tive but a mutation rate that was a 1000-fold lower than NZ131 
(Figure 2). So, in spite of the fact that SpyCIMS was perma- 
nently integrated into the Manfredo chromosome, this strain was 
wild type for the MMR operon. This paradox was resolved by 
the discovery of a novel promoter within the SpyCIMS integrase 
pseudogene that rescued the expression of mutL and the down- 
stream genes. Interestingly, expression from this novel promoter 
was depressed by mitomycin C treatment, which was in con- 
trast to the activation of the MMR in SF370 and other strains 
with a SpyCI capable of excision from mutL. It remains unknown 
whether this apparent mechanism of gene expression control is 
the result of natural selection or merely a circumstantial byprod- 
uct of evolution of this compensatory promoter (Scott et al., 
2012). 

Comparisons of mutation rates between strains possessing 
or lacking SpyCI is informative, but the conclusions are infer- 
ential since other genes not in this operon may influence the 
observed final phenotype. Therefore, to directly assess the effect of 
SpyCIMI carriage alone, the chromosomal island was cured from 
the Ml SF370 genome to create an isogenic derivative, and com- 
parisons of the two strains show that island integration is respon- 
sible for a 200-fold increased mutation rate, increased sensitivity 
to ethidium bromide, increased UV irradiation sensitivity, and 
higher rates of single point mutations (Euler et al, submitted). 
As the island excises in response to growth, this dynamic regula- 
tion of the operon allows the organism to maintain genetic fidelity 
in optimal conditions while selectively increasing mutation rates 
during stressful conditions (Scott et al., 2008). 

CONSERVATION AND DIVERSITY OF SpyCI GENES 

SpyCI and other phage-like chromosomal islands may have orig- 
inated from defective prophages, but their biology suggests an 
even more complex origin. The defining gene of SpyCI is the 
integrase (int), whose expression and regulation controls the 
molecular switch for the MMR operon. Within the known SpyCI, 
int is highly conserved at both the gene and protein level (Scott 
et al., 2008, 2012). The SpyCI integrase genes form a distinct 
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FIGURE 2 I The mutation rates of SpyCI'*' S. pyogenes genome and 
clinical strains. Strains harboring SpyCI integrated into mutL have 
phenotypes that range from non-mutator in Manfredo, where a novel CI 
promoter in the defective integrase gene rescues expression of mutL, to a 



hypermutator in M6 genome strain MGAS10394. Strain NZ131 does not carry 
a SpyCI and is presented as a wild type strain with regards to the MMR 
operon. The mutation rate is calculated as mutations per generation. The 
figure was drawn using the data from Scott et al. (2012). 
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group within the S. pyogenes prophage integrases (McShan, 2005), 
although the integrase encoded by S. pyogenes strain NZ131 
prophage NZ131.1 is 62% identical to the SpyCIMl integrase at 
the amino acid level, suggesting that these genes may have had a 
recent common origin (McShan et al., 2008). The integration site 
for prophage NZ131.1 is near the promoter of hypothetical pro- 
tein Spy49_0371 (McShan et al, 2008), which is unrelated to the 
MMR operon. 

Typical for prophages and phage-like CI, two genes encod- 
ing predicted DNA-binding proteins are found upstream of int 
and arranged in opposite orientations flanking a probable oper- 
ator site. These genes, which encode the predicted repressor and 
antirepressor, are the likely candidates for the control of SpyCI 
integration and excision. Excision of SpyCI can be induced by 
mitomycin C treatment (Scott et al., 2008, 2012), suggesting that 
the mostly uncharacterized S. pyogenes SOS DNA repair pathway 
can trigger SpyCI excision and may lead to packaging by a helper 
prophage for dissemination to new host cells (Nguyen, unpub- 
lished observations). However, the cellular signals that induce 
the normal cycle of SpyCI excision and re-integration during 
growth, which presumably is triggered by repressor cleavage, are 
yet unknown. Interestingly, these genes and their operators vary 
between the different SpyCI, and these differences may contribute 
to the range of mutation rates seen between the different S. pyo- 
genes genome strain that harbor these CI (Scott et al., 2012). That 
is, SpyCI repressors that are more sensitive to cleavage would 
favor the episomal form of the SpyCI and the appearance of the 
wild type phenotype with respect to MMR while repressors that 
are more stable would favor integration and the mutator phe- 
notype. The operator DNA sequences of SpyCIMl, SpyCIM2, 
SpyCIM28, and SpyCIM53 are all nearly identical while the 
lysogeny module of the hypermutator M6 strain MGAS 10394 
is essentially the same as the emerging hypervirulent strain 
MGAS15252 (Fittipaldi et al, 2012; Scott et al, 2012). As we 
previously pointed out, the SpyCI controlled mutator phenotype 
in MGAS15252 may contribute to the many striking pheno- 
typic changes that are associated with this strain, which include 
enhanced transmission by skin contact, significantly impaired 
ability to grow in saliva, and a tendency not to colonize the 
oropharynx (Scott et al., 2012). 

Other SpyCI genes are conserved, forming part of the core 
set of genes that identify these mobile genetic elements. The pre- 
dicted primase and replicase genes (Figure 1) are closely related to 
homologs from Streptococcus thermophilus plasmid pStl06, which 
are essential for plasmid replication (Geis et al, 2003). While the 
molecular details of these genes in SpyCI DNA replication and 
extrachromosomal maintenance remain to be determined, their 
high degree of conservation within these CI argues their essential 
role (Scott et al, 2008), presumable to ensure DNA replication 
of episomal SpyCI. Other SpyCI genes show various degrees of 
conservation. One universally conserved gene, which was not 
originally annotated due to its small size, is positioned immedi- 
ately upstream of int and encodes a small transmembrane domain 
peptide of unknown function (Figure 1). If this ORF really repre- 
sents an expressed gene, then its product has the potential to alter 
the surface properties of the streptococcal cell, potentially alter- 
ing its antigenicity, functioning as an environmental sensor, or 



providing a protection mechanism against lytic phages by inter- 
fering with their attachment. Other genes, based upon homology 
with known homologs, may function like a toxin-antitoxin pair 
to prevent SpyCI eliminations (labeled fox and atx in Figure 1). 
None of the SpyCI-encoded genes include an identifiable DNA 
polymerase or a terminase subunit, which plays an important role 
in the helper phage packaging process of the Staphylococcus aureus 
pathogenicity islands (SaPI) (Novick et al, 2010). Most of the 
remaining ORFs in the SpyCI encode products of unknown func- 
tion, and much variation exists between the individual members 
(Scott et al., 2008). The driving forces behind the genetic diversity 
of SpyCI from different S. pyogenes isolates are poorly understood 
at present. 

MISMATCH REPAIR CHROMOSOMAL ISLANDS OF OTHER 
STREPTOCOCCUS SPECIES 

The transcriptional linking of mutS and mutL is not uni- 
versal in prokaryotes, although analysis of publicly available 
genomes shows that many streptococcal species do group these 
genes as a unit. This group includes, in addition to S. pyo- 
genes. Streptococcus mutans. Streptococcus equi subspecies equi 
and subsp. zooepidemicus. Streptococcus agalactiae. Streptococcus 
uteris, and Streptococcus thermophilus (Table 2). Interestingly, 
Streptococcus pneumoniae does not follow this pattern, encoding 
MMR genes hexA and hexB at distant sites on its chromosome. 
Genes ruvA and tag are positioned near hexB on the S. pneu- 
moniae chromosome and are probably co-transcribed with each 
other but not with hexB. The gene composition of the operon 
found in S. pyogenes and other related streptococci may repre- 
sent an instance where evolution has selected for an arrangement 
that simplifies the control of expression for several housekeep- 
ing genes. This arrangement, however, has allowed mobile genetic 
elements like the SpyCI to assume a unique regulatory role. 

Using the SpyCIMl integrase gene as the query, a TBLASTN 
search (Altschul et al, 1997) of the available complete or partial 
bacterial genomes revealed many related islands in Streptococcus 
species that are integrated into the same attachment site in 
mutL, including Streptococcus anginosus. Streptococcus inter- 
medins. Streptococcus dysgalactiae subsp. equisimilus. Streptococcus 
canis, and Streptococcus parauberis (Table 2). In these species, 
the CI may potentially regulate the MMR operon much like 
how SpyCIMl does. Ignoring the defective SpyCIMS integrase 
that has a 128 bp deletion in the gene (Scott et al., 2012), these 
integrases share at least 64.0% amino acid sequence identity 
(Supplemental Figure 1). Phylogenetic tree analysis of the inte- 
grases shows close similarity as well with little sequence distance 
(Figure 3). Perhaps this is not surprising, as these islands have 
conserved core sequences required for integration into strepto- 
coccal mutL, which itself provides a conserved target (Figure 3, 
insert). However, despite strong conservation of the integrase, 
other regions of these CI show considerable genetic diversity 
between species. Genomic alignment of the islands revealed four 
distinct groups within the MMR islands (Figure 4). The diversity 
and frequency of these islands is remarkable with chromosomal 
islands found in streptococcal species associated with human dis- 
ease, streptococcosis in flounder and dairy bovine mastitis (Nho 
et al, 2011; Lefebure et al, 2012). 
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Table 2 | Chromosomal islands identified in other streptococcal species that target mutL or other genes for integration. 



Species 


Strain 


CI 


attB 


Gene target 


GenBanIc accession 


S. dgdidctide 


Al CL^ 1 oo lo 


oagui 


rpsD 


J 


AbUUU lUUUUUo 


S. anginosus 


1505 


SanCI 


mutL 


5' 


NZ_DAbVVUUUL)L)L)L)L) 


S. anginosus 




oanui 


mutL 


0 


L^rUUooo 1 


S. anginosus 


rVZ \ 1 


oanui 


mutL 


0 


A c r'Tn r\r\r\r\r\r\r\ 
AbU 1 UUUUUUUU 


S. anginosus 


J4206 


SanCI 


mutL 


5' 


KLbl /o/U 


S. anginosus, subsp. wliileyi 


L-LUb oy 1 by 


banui 


mutL 


5' 


AlCrUUUUUUUU 


S. anginosus, subsp. wliileyi 


MAbDz4 


SanCI 


mutL 


5' 


ArUl JU/z 


S. can is 


rbL /.S-ZZ/ 


ScaCI 


mutL 


5' 


Ni"7 A \r\\/r\r\r\r\r\r\r\r\ 

NZ_AIUaUUUUUUUU 


S. dysgalactiae, subsp. equisimilis 


167 


SeqCI 


mutL 


5' 


Ar012976 


S. dysgalactiae, subsp. equisimilis 


SKI 249 


SeqCI 


mutL 


5' 


AhlNOl 000000 


S. intermeaius 


Al LL 2/335 


SinCI 


mutL 


5' 


NZ_AI l-KOOOOOOOO 


S. intermedius 


F0413 


SinCI 


mutL 


5' 


ArXUUUUUUUUU 


S. intermedius 


JTH08 


SinCI 


mutL 


5' 


Aruiuyby 


S. intermedius 


SK54 


SinCI 


mutL 


5' 


NZ_AJKNUUUUUUUU 


S. mitis 


1 I/O 


oiniui 


uvrA 


0 


A MTTm nnnnm 
AU 1 1 U lUUUUU 1 


S. mitis 


DO 


bmiLl 


manA 


5' 


CM nconco 


S. mitis 


SK569 


bmiLI 


urvA 


5' 


Nz_Ar U rU 1 UUUUUU 


S. mitis 


ol\D lb 


omiui 


urvA 


0 


AIL-nUUUUUUUU 


S. mitis 


olx lU /o 


Qi-viir*i 
bmiui 


uvrA 


0 


A c/^Tm nnnnni 
ArU 1 U lUUUUU i 


S. mitis 


olx 1 Ut5U 


omiui 


uvrA 


0 




S. parauberis 


Kl^ 1 U 1 1 Oo / 


opaui 


mutL 


O 


Ljruuz4/ 1 


S. parauberis 


Kno-UZUoo 


opaui 


mutL 


0 


M7 A 1 Vh/innnnnnnn 

lNZ._ALl IVIUUUUUUUU 


S. parauberis 


l\no-Uz lUy 


bpaui 


mutL 


0 


Nz_ALVVnUUUUUUUU 


S. pneumoniae 


zU/ IUU4 


bpnLl 


uvrA 


5' 


ALdJUUUUUUUU 


S. pneumoniae 


/UoOblci 


SpnCI 


uvrA 


5' 


Al Di r\ r\ r\ r\ r\ r\ r\ r\ 

ALdLUUUUUUUU 


S. pneumoniae 


GA14688 


SpnCI 


uvrA 


5' 


AIKQ01000003 


S. pneumoniae 


GA17719 


SpnCI 


uvrA 


5' 


AILO01000005 


S. pneumoniae 


Hungary 19A-6 


SpnCI 


uvrA 


5' 


NC_010380 


S. pseudo-pneumoniae 


ATCC BAA-960 


SpsCI 


uvrA 


5' 


AlCSOlOOOOOl 


S. species 


M334 


CI 


uvrA 


5' 


PRJNA62529 


S. suis 


D9 


SsuCI 


recF 


5' 


NC_017620 


S. thermophilus 


CNRZ1066 


SthCI 


metE 


5' 


NC_006449 



Except for SagCI integration into the 3' end of rpsD, ali CI listed integrated into the 5' end of their targeted genes and thus could alter their transcription. 

Gene key: manA — alpha-1 ,2-mannosidase; metE — methionine synthase; mutl^DNA MMR protein MutL; recF— recombination protein RecF; rpsD — 30S ribosomal 

protein S4; uvrA — UvrABC endonuclease subunit. 



The MMR operon, as organized in S. pyogenes, is present in 
many other species of the genus streptococcus. However, there is 
one major differentiating characteristic in this operon that creates 
a division between these other species: the presence or absence of 
the gene for MDR LmrP (Figures and Supplemental Table 1). 
Groups A, B, C, and G streptococci have ImrP inserted between 
mutL and ruvA as do a number of other species including S. iniae, 
S. uberis, and S. parauberis. While S. mutans and most viridans 
streptococci do not have ImrP, a few, like S. oralis, do. The pres- 
ence or absence of this MDR gene, as well as its regulation by a 
chromosomal island, raises some interesting biological questions. 
Why is a gene for a drug efflux pump transcriptionally linked 
to DNA repair genes? Further, what is to be gained, if anything, 
by the cell through inhibition of expression of this gene? Indeed, 
why is this gene dispensable in some species but present in oth- 
ers? The answers to these questions will come as the role of this 
MDR protein in the biology and virulence of S. pyogenes and 



other species is determined. SpyCI-like CI that target mutL usu- 
ally are found in species that have ImrP; however, S. anginosus and 
S. intermedius, both members of the mOleri group that lack ImrP, 
have acquired CI that integrate into this gene. 

In addition to CI identified by genome sequencing, another 
S. anginosus CI was identified from a clinical source. A dap- 
tomycin resistant strain of S. anginosus (strain J4206) was iso- 
lated from a patient with bacteremia and septic shock (Palacio 
et al., 2011) and was found to have a CI integrated into mutL. 
This CI (SanCI 14206) has been sequenced in our labora- 
tory and its impact on the host mutator phenotype assessed 
(manuscript in preparation). Daptomycin resistance results from 
multistep genetic changes and is thought to be rare (Tran et al., 
2013). As the related SpyCI are known to confer a mutator 
phenotype in their host, SanCI J4206 could putatively con- 
tribute to the emergence of daptomycin resistance in S. anginosus 
through hypermutability. Previously, daptomycin resistance has 
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FIGURE 3 I Phylogenetic tree of the streptococcal CI integrases that 
target MMR gene mutL. Analysis of the chromosomal island integrases 
amino acid sequences that target mutL was used to construct a phylogenetic 
tree showing the four known major groups. The insert shows the consensus 



alignment of the mutL. Strain details are given in Tables 1, 2. The proteins 
encoded by the CI integrase genes were aligned and the phylogenetic tree 
created using Geneious v. 6.1.7 (Drummond et al., 2012). The consensus of 
the mutL attachment site was created using WebLogo (Crooks et al., 2004). 



not observed in an in vitro study of 106 S. anginosus isolates 
(Streit et al., 2005). 

It is possible that SpyCI, as well as other Gram-positive phage- 
like chromosomal islands (Novick et al., 2010), have a complex 
evolutionary history and their genetic material may have orig- 
inated from disparate sources. While each chromosomal island 
shows considerable diversity (Scott et al, 2008), several genes, 
notably the integrase, primase, and replicase genes, are highly 
conserved, providing clues to the minimal genome composition 
needed for a functional CI. 

STREPTOCOCCAL CI WITH OTHER GENE INTEGRATION 
TARGETS 

Streptococcal chromosomal islands have been identified by 
genome sequencing that target genes other than mutL (Table 2 
and Figure 6). Although the biological impact of integration into 
any of these genes is not yet known, some predictions may be 
made based upon whether the 5' or 3' end of the ORF is the point 
of site-specific recombination. For example, in three of the nine 
currently available genomes of S. agalactiae, a SagCI is found inte- 
grated into rpsD, which encodes the 30S ribosomal protein S4. 
Given the essential role of this protein in ribosome function, it is 
perhaps no surprise that this chromosomal island integrates into 
the 3' end of the rpsD ORF so transcription is unimpeded. Other 
genes, by contrast, could be regulated in their expression patterns 
to the benefit of the cell, at least under certain environmental or 
physiological conditions. For example, the chromosomal island 
from the B6 strain of Streptococcus mitis (SmiCIB6) integrates into 
the 5' end of the gene encoding alpha-l,2-mannosidase [manA) 
while the Streptococcus thermophilus CI (SthCI) integrates into 



metE, encoding methionine synthase. In both cases, a SpyCI- 
like switch could activate or silence these genes to optimize host 
fitness. Regulation of metE expression could alter the relative 
intracellular levels of homocysteine and methionine (Matthews 
et al., 1998), which under some conditions might be favorable 
to the cell. Similarly, S. mitis B6 may only need the action of 
alpha- 1,2-mannosidase when a need arises to use glycan or glyco- 
proteins as a carbon source as observed in other oral streptococci 
(Tarelh et al, 1998). 

The S. mitis SmiCI provide an interesting glimpse into the evo- 
lution and diversification of phage-like CI. As discussed above, 
SmiCI B6 integrates into manA. However, the related SmiCI from 
other S. mitis strains as well as the SpnCI from S. pneumoniae 
target the probable operator controlling the expression of the 
gene (uvrA) encoding the A subunit of the UvrABC excinucle- 
ase that is a key component of nucleotide excision repair (NER). 
Remarkably, the SmiCI B6 integrase has 98% homology at the 
protein level with the other SmiCI and the SpnCI integrases 
(Figure 7), even though they target separate DNA genes {manA 
and uvrA, respectively). This observation suggests that only a rel- 
atively few amino acid changes were necessary to expand the gene 
repertoire of this CI, creating new possibilities for altering host 
expression patterns. The SmiCI and SpnCI targeting of uvrA, a 
key gene for another essential DNA repair pathway (NER), sug- 
gests that ability to selectively adopt a mutator phenotype in a 
regulated fashion provides a selectable advantage for these cells. 
Based upon the example of SpyCI in S. pyogenes, one could pre- 
dict that the SpnCI and SmiCI will have a cycle of excision and 
re-integration in response to the growth state of the cell or some 
environmental sensor. 
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Intermedius Group 



Mitis/Pneumoniae Group 



FIGURE 4 I Phylogenetic tree of streptococcal chromosomal islands 
integrated into DNA repair genes mutL. Full genome trees are derived. 
In the Pyogenes group, all of which integrate into mutL, the circled 
members of the tree are CI found in the closely related species 
S. dysgalactiae subsp. equisimilis (SeqCI; nos. 9 and 13) and S. canis 
(ScaCI; no. 17). The SmiCI that integrates into manA in the 
Mitis/Pneumoniae group is boxed for identification; the remainder of the 



group integrates into uvrA. The members of the Anginosus and 
Intermedius groups integrate into mutL and have a conserved integrase 
gene; however, the remainder of their chromosomes is divergent, causing 
them to form separate phylogenetic groups. The one exception is that the 
S. intermedius strain F0413 SinCI (#42; underlined) is closely related to the 
SanCI and forms part of that group. The detailed legend for the individual 
members of the tree is presented in Supplemental Table 2. 



INDELS AND CI-MEDIATED PHENOTYPES 

Many members of the streptococcal phage-like CI have notable 
indels (insertions and deletions) in their genomes that may con- 
tribute to the host phenotype (Figure 6). We previously reported 
on the impact of the 128 bp deletion in the SpyCIMS integrase 
gene on S. pyogenes strain Manfredo and the solution the cell 
adopted to prevent permanent silencing of mutL and the down- 
stream genes of the operon (Scott et al., 2012). The SpyCIM53 
element from strain Alab49 (Bessen et al., 2011) is genetically 
quite related to SpyCiMl from strain SF370 but has acquired the 
insertion element IS 1548 (Figures 4, 6). The recently reported 
severe disease associated, epidemic strain MGAS15252 (Fittipaldi 
et al, 2012) contains SpyCIM59.1, a CI with an indel (ebhA) 
predicted to encode extracellular matrix-binding protein with a 
transmembrane domain, which could be a potential virulence 
factor. 

The phage-like CI from the non-pyogenes streptococcal species 
often contain indels that may contribute to host fitness or prevent 



elimination of the CI. A gene encoding an ATPase related to the 
RecG superfamily family of proteins is found in the S. anginosus 
SanCI as well as the phylogenetically separate SmiCI from S. mitis 
and SpnCI from S. pneumoniae (Figures 4, 6). Protein RecG pro- 
vides a means of rescuing stalled replication forks in E. coli (Briggs 
et al., 2004), and these Cl-encoded homologs might offset some 
aspects of the mutator phenotype caused by the integration of the 
CI into mutL (SanCI) or uvrA (SmiCI and SpnCI). Another CI 
indel that frequently appears is a homolog of the KiLA family of 
proteins (found in the SinCI, ScaCI, and SsuCI genomes). The 
kilA gene was identified in the broad-host-range E. coli plasmid 
RK2 and originally detected by the potential lethality of its gene 
product to the host cell (Goncharoff et al., 1991). Based upon 
previously reported criteria (Makarova et al., 2009), a number 
of potential toxin-antitoxin genes are found in the streptococcal 
phage-like CI that may function to prevent elimination of the CI 
from the host during replication and cell division (Figure 6), and 
these KiLA homologs may also contribute to CI maintenance. 
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FIGURE 5 I The MMR operon region from selected streptococcal species. 

The MMR operon in S. pyogenes and related streptococci is composed of a 
core group of genes involved in MMR, Holliday junction resolution and BER 
{mutS, mutL, ruvA, and tag). Indels are frequently found to supplement this 
basic genetic unit: MDR gene ImrP forms an additional part of this core in 
groups A, B, C, and G streptococci. Some indels encode proteins of unknown 
function, but others appear to encode potential virulence factors such as 



pauA (streptokinase) in S. uberis and the collagen-like adhesin in 
S. intermedius. For an extended list of streptococcal species that are 
differentiated by the presence or absence of ImrP, see Supplemental Table 1. 
The core genes of the streptococcal MMR operon are colored orange; MDP 
gene ImrP is magenta, and indels are red. The attachment site for SpyCI in 
S. pyogenes is indicated {attB). Legend for indel genes: glo — glyoxalase 
family protein; slaB — phospholipase A2 SlaB; pauA — streptokinase. 



Other streptococcal phage-like CI indels may improve host 
fitness to environmental challenges. Resistance to cadmium and 
other heavy metals in microbes occurs fi-equently, which inter- 
estingly is often co-selected with antibiotic resistance and may 
contribute to the persistence of resistance genes in the environ- 
ment (Baker-Austin et al., 2006). The S. parauberis SpaCI has 
acquired the cadDX cadmium resistance operon that is related 
to ones found in S. aureus and Streptococcus salivarius (O'Brien 
et al., 2002; Chen et al., 2008). Many heavy metal resistance mech- 
anisms are able to act upon multiple substrates (Baker- Austin 
et al., 2006), and the principal substrate for resistance conferred 
by this SpaCI operon in S. parauberis is unknown. However, since 
cadD encodes a transmembrane protein, its gene product wiU 
alter the surface properties of the host cell, which could not only 
confer resistance but also alter antigenicity and charge of the cell 
membrane. The SpaCI also encodes a second protein with the 
potential for altering the surface of its host: Ltp, a member of a 
family of phage encoded lipoproteins that are involved in super- 
infection exclusion. Proteins of this family have been shown to 
act at the stage of DNA release from the phage head into the cell 
(Neve et al., 1998), thus interfering with the lytic cycle of phages 
that might infect S. parauberis. Other streptococcal phage-like CI 
also carry genes that may protect their host cell from infection 
by invading lytic phages or induction of endogenous prophages 
to the lytic cycle. The SanCI of S. anginosus encode the gene abi, 
which encodes a protein that is a member of a protein family that 
mediates bacteriophage resistance by causing abortive infection 



in Lactococcus species (Anba et al, 1995; Bidnenko et al, 1995). 
The SanCI encoded Abi protein has a predicted helix-turn-helLx 
DNA binding motif, which in these proteins is thought to play 
a role in the interference of the phage life cycle through alter- 
ing the transcriptional program of the virus. In addition to the 
kilA gene discussed above, the S. canis ScaCI contains several 
other indels that may promote host fitness through protection of 
cell metabolism. Located immediately upstream of int is a gene 
encoding a homolog of the DinD DNA damage induced pro- 
tein. In E. coli, the DinD protein inhibits RecA-mediated DNA 
strand exchange, which may limit unwanted homologous recom- 
bination (Uranga et al., 2011). Another ScaCI gene (ybaK) also 
may encode a system to increase host ceU fitness. Members of 
the YbaK protein family have deacylase domains are trans-acting 
amino acid-editing class II prolyl tRNA synthetases, whose pri- 
mary function is to hydrolyze mischarged cysteinyl-tRNA'''^°, thus 
ensuring the fidelity of translation and prevent accumulation of 
mistranslated proteins (Kumar et al., 2012; Das et al, 2014). 

CI AS REGULATORY ELEMENTS OF THE HOST PHENOTYPE 

While the regulation of MMR by SpyCI in S. pyogenes and other 
streptococci is a remarkable evolutionary adaptation, it is one of a 
number of examples of how phages and phage-like mobile genetic 
elements have evolved to form a beneficial relationship with their 
host bacterium. In E. coli, several examples of integrated phage- 
like elements affecting gene sequence or expression have been 
reported. In strain K-12, a P-4-like cryptic prophage controls 
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FIGURE 6 I The CI of S. pyogenes and related species. Tine identification of 
eadi CI is shown to the left of its map, while the integration site {attB) is 
shown to the right of the map (see Tables 1, 2 for identification of CI and attB 
sites). INDELs with identifiable homologous genes are labeled by that 
homolog. Color key: orange — site-specific integrase; dark blue — control of 
lysogeny; light blue — DNA replication; red — maintenance; pink — INDELs; 
green — unknown function; gray — pseudogene. INDEL key: IS1548 — insertion 



element 1548; ebhA — transmembrane surface adhesin; 
recG — recombination protein RecG superfamily; abi — abortive phage 
infection protein; kilA — plasmid maintenance protein; Itp — host cell 
surface-exposed lipoprotein; cadD and cadX — cadmium resistance proteins; 
dinD — DNA damage inducible protein; DUF1211 — domain of unknown 
function; ybaK — prolyl tRNA synthetase. Maps were created using Gene 
Construction Kit (Textco BioSoftware, West Lebanon, NH). 



expression of the AlpA transcriptional regulator by site-specific 
recombination. Overexpression of alpA leads to suppression of 
capsule overproduction and UV sensitivity in cells defective for 
the Lon protease (Kirby et al, 1994; Trempy et al., 1994). The inte- 
gration of the cryptic prophage suppresses alpA expression and 
restores normal capsule production and UV sensitivity. Unlike 
the dynamic cycle of MMR control by SpyCIMl, excision of the 
cryptic prophage leads to the elements loss and a permanent 
conversion of cell phenotype (Kirby et al., 1994). Similarly, inte- 
gration of phage lambda into a secondary attachment site at the 
guaB promoter inhibits the expression of inosine monophosphate 



(IMP) dehydrogenase (Thomas and Drabble, 1986). In this case, 
though, the selective benefit of the resulting inhibition of de novo 
purine biosynthesis following integration is uncertain and may 
be just an unfavorable event that would be unstable over time. 
The integration of Lambdoid phage 21 replaces 165 bp of the 3' 
end of the isocitrate dehydrogenase gene icd to provide an alter- 
native ending (Campbell et al., 1992), which again is an event of 
unknown biological impact that may warrant further study. 

A number of Gram-positive examples of gene control by 
mobile phage-like elements have also been reported. The sigma 
K intervening element (skin) of Bacillus subtilis is a 48 kB 
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***********************************************^ **************** **.******^** ** 

SpnCI Hungary19A-6 MKITEVIKKDGSKVyRANVYLGVDQVTGKKVKTKVTGRTQKEVKQKANQEKIAFQKAGSTRQKAVTIKNYQELATLWLES 80 

SmiCI SK1080 MKITEVIKKDGSKVYRANVYLGVDQVTGKKVKTKVTGRTQKEVKQKATQEKIAFQKAGSTRQKASTIKNYQELANLWLES 80 

SmICI B6 MKITEVIKKDGSKVYRANVYLGVDQVTGKKVKTKVTGRTQKEVKQKANQEKIAFQKAGSTRQKAITIRNYQELATLWWES 30 



*************; ******** ** ****************.****;****; ^ ****** ****************** 

SpnCI Hungary19A-6 SKMTVKPNTQGNVKKLlDNHILPIFGAYKLDKLTTPLIQSIINiLABKTNRGEKAAFLHYDKIHALNKRILQYGVTMQAI 160 

SmiCI SK1080 YKNTVKPNTQGNVRKLIDNHILPIFGAYKLDKLTTPLIQSIINELADKTNRGEKGAFLHYDKIHALNKRILQYGVTMQAI 160 

SmiCI B6 YKNTVKPNTQGNVRALIgNHILPTFGEYKLDKLTTPLIQSIINKLADKANRGgE6AFLg2^IHALNKRILQYGVTMQAI ^gg 



*******************.*.***.***•**** ******. .*****. _***************************** 
SpnCI Hungary19A-6 PSHPARDWLPRNTQKAKRKKVKHFENQELKKFLGYLDNLDTHRYRYYYETTLYKFLLATGCRINEALALSWSDIDLDNA 240 

SmiCI SKia80 psnpardwlprntqkakrqkvkhfenqeikkflgyldnldsdryryyyettlykfllatgcrinealalswsdidldna 240 

SmiCI B6 PSNPARjJWLPRNTQKAKRQKLKHFDNQELKKFLVYLgNLDSCKYRYYYDVTLYKFLLATGgRINEALALSWSglgLgNA 240 



;**;*****; ;********;********* *********************** **********^* ;**;** **** 

SpnCI Hungary19A-6 WHITKTLNRDIEINSPKSKASYRDIDIDQATVSMLKQYKLRQTKEAWKIGQRERWFSDFIHEYPSSSRLKRRLQTHFK 320 
SmiCI SK1080 WHVTKTLNRDLEINSPKSKASYRDIDIDQATVSMLKQYKLRQTKEAWKIGQRESWFSDFIHEYPSSSRLKRRLQTHFK 320 

SmiCI 86 IVHVTKTLNQYLEINSPKSKSSYRDIDIDQETVSMLKQYKLRQTKEAWKIGQRESWFSDFIHEYTSIVKLKKRLLTHFK 329 




*******************************.***** .***** *********************** 

RADVPNIGFHGFRHTHASLLLNSGIPYKELQYRLGHSTLSMTMDIYSHLSKENAKKAVSFYETALKAL 388 

RADVPNIGFHGFRHTHASLLLNSGIPYKELQYRLGHSTLSMTMDIYSHLSKENAKKAVSFYETALKAL 388 

RADVPNIGFHGFRHTHASLLLNSGIPYKELQHRLGHSQISMTMDTYSHLSKENAKKAVSFYETALKAL 333 



SpnCI Hungary19A-6 
SmiCI SK1080 
SmiCI 86 



FIGURE 7 I Alignment of the SmiCI and SpnCI integrases that target 
either manA or uvrA. The integrase proteins encoded by the S. mitis SmiCI 
B6, the S. mitis SmiCI SK1080, and S. pneumoniae Hungaryl9A-6 SpnCI 
were aligned using ClustalX (Jeanmougin et a!., 1998). The integrases from 
SpnCI Hungaryl9A-6 and SmiCI SK1080 both recognize an identical 



attachment site in the promoter of uvrA, while the SmiCI B6 integrase 
recognizes manA. Above the alignment, an asterisk (*) indicates identity 
between the three proteins, a colon (:) indicates a strongly conservative 
amino acid substitution, and a dot (.) indicates a weakly conservative amino 
acid substitution 



prophage-like element that integrates into sigK, separating the 
gene into regions historically known as spoIIIC and spoIVCB 
(Kunkel et al., 1990). Excision of the skin element from the chro- 
mosome leads to reconstitution of the sigK gene, which during 
sporulation encodes the mother-cell-specific factor (Stragier 
et al, 1989). Likewise, a smaller, 14.6 kb element {skin^'^) was 
found to interrupt sigK in Clostridium difficile; unlike in B. subtilis 
where skin may be deleted without major impact upon sporu- 
lation, the C. difficile skin'^'^ is required for efficient completion 
of this event (Haraldsen and Sonenshein, 2003). Interestingly, 
phylogenetic analysis shows that these skin elements arose inde- 
pendently in Bacillus and Clostridium, leading to the speculation 
that this unusual form of sigK regulation may have some specific 
selective advantage in the regulation of sporulation. In Listeria 
monocytogenes, a recent report showed that the DNA uptake 
competence system, considered non-functional, has a temper- 
ate prophage integrated into comK (Rabinovich et al, 2012). 
Surprisingly, the L. monocytogenes Com system promoted bacte- 
rial escape from macrophages, and the regulation of this system 
depended upon the activation of comK following prophage exci- 
sion, which was specifically induced during intracellular growth. 



reminiscent of the activation of SpyCIMl activation at the onset 
of exponential phase (Scott et al, 2008). Recently, a phage-like 
chromosomal island from the genome of Enterococcus faecalis 
V583 has been described and its requirement for a helper phage 
for packaging demonstrated (Mates et al, 2013). The integra- 
tion target site for this element is identified as the promoter of 
a xanthine/uracil permeases family of genes; however, the pre- 
cise function of this gene in E. faecalis or the impact of the 
chromosomal island integration into this gene remains unknown. 

The relation between mobile genetic elements and host gene 
expression is a key component of the biology of S. aureus, which 
in many ways resembles these relationships in S. pyogenes. These 
elements range from typical lambdoid prophages to the SaPI 
phage-like chromosomal islands. S. aureus prophages have been 
demonstrated to mediate gene conversion by controlling the 
expression of lipase, P-Lysin, Staphylokinase, and Enterotoxin A 
(Lee and landolo, 1985; Coleman et al, 1989; Zabicka et al, 
1993), while the SaPI are vectors for the toxic shock syndrome 
toxin (TSST) (Lindsay et al, 1998; Novick et al, 2010). In addi- 
tion to being vectors for TSST, the SaPI carry other genes that 
modify the host phenotype such as a biofilm-associated protein 
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(BAP) (Ubeda et al, 2003) or von Willebrand factor-binding 
protein (Lindsay et al., 1998; Viana et al, 2010). 

In their pioneering paper demonstrating the unexpectedly 
high frequency of the mutator phenotype in wild populations 
of bacteria, LeClerc and co-workers observed that "the ultimate 
pathogen would possess an elevated mutation rate that is tran- 
sient (or conditional), providing genetic variation during the first 
few hours when the pathogen must survive, invade, and colonize 
its host" (LeClerc et al., 1996). The SpyCI of S. pyogenes and the 
similar CI that colonize related species of the genus streptococcus 
may well be an example of a system that fulfills this prediction by 
their unique mechanism of MMR control. The frequent occur- 
rence of MMR defects in natural bacterial populations argues 
that a selective benefit exists in this phenotype, whether it stems 
from increased mutability or the potential for horizontal gene 
transfer (Matic et al, 1995). In most species of bacteria, how- 
ever, the mutator phenotype is fixed and poses a distinct risk to 
the cell in the form of acquiring unwanted mutations that might 
lead to decreased viability. So far, only in the streptococci has 
a system been discovered that allows the cell to switch between 
a mutator and wild type phenotype to presumably achieve a 
balance between costs and benefits. The presence of other phage- 
like CI in the various streptococcal species that potentially target 
other genes for regulation suggests that these elements may be an 
important aspect of the biology of these low G-|-C% bacteria. The 
widespread occurrence by SpyCI and related CI in the pathogenic 
streptococci may be a clue to their importance to the virulence 
and survival of these bacteria, which may prove ultimately to be 
as significant as the carriage of toxigenic bacteriophages. 
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